A More Relaxed Model for Graph-Based Data Clustering: s-Plex Editing
نویسندگان
چکیده
We introduce the s-Plex Editing problem generalizing the well-studied Cluster Editing problem, both being NP-hard and both being motivated by graph-based data clustering. Instead of transforming a given graph by a minimum number of edge modifications into a disjoint union of cliques (Cluster Editing), the task in the case of s-Plex Editing is now to transform a graph into a disjoint union of so-called s-plexes. Herein, an s-plex denotes a vertex set inducing a (sub)graph where every vertex has edges to all but at most s vertices in the splex. Cliques are 1-plexes. The advantage of s-plexes for s ≥ 2 is that they allow to model a more relaxed cluster notion (s-plexes instead of cliques), which better reflects inaccuracies of the input data. We develop a provably efficient and effective preprocessing based on data reduction (yielding a so-called problem kernel), a forbidden subgraph characterization of s-plex cluster graphs, and a depth-bounded search tree which is used to find optimal edge modification sets. Altogether, this yields efficient algorithms in case of moderate numbers of edge modifications.
منابع مشابه
A More Relaxed Model for Graph-Based Data Clustering: s-Plex Cluster Editing
We introduce the s-Plex Cluster Editing problem as a generalization of the wellstudied Cluster Editing problem, both being NP-hard and both being motivated by graph-based data clustering. Instead of transforming a given graph by a minimum number of edge modifications into a disjoint union of cliques (this is Cluster Editing), the task in the case of s-Plex Cluster Editing is to transform a grap...
متن کاملParameterized Algorithmics for Network Analysis: Clustering & Querying
Preface This thesis summarizes some of my results on NP-hard graph problems that have applications in the areas of network clustering and querying. The research for obtaining these results was Forschungsgemeinschaft (DFG), as a researcher in the DFG project " Parameterized Algorithmics for Bioinformatics " (PABI, NI 369/7). I want to express my gratitude to Rolf Niedermeier for giving me the op...
متن کاملParameterized Algorithmics for Network Analysis: Clustering & Querying
Preface This thesis summarizes some of my results on NP-hard graph problems that have applications in the areas of network clustering and querying. The research for obtaining these results was Forschungsgemeinschaft (DFG), as a researcher in the DFG project " Parameterized Algorithmics for Bioinformatics " (PABI, NI 369/7). I want to express my gratitude to Rolf Niedermeier for giving me the op...
متن کاملGraph Clustering by Hierarchical Singular Value Decomposition with Selectable Range for Number of Clusters Members
Graphs have so many applications in real world problems. When we deal with huge volume of data, analyzing data is difficult or sometimes impossible. In big data problems, clustering data is a useful tool for data analysis. Singular value decomposition(SVD) is one of the best algorithms for clustering graph but we do not have any choice to select the number of clusters and the number of members ...
متن کاملA Fast Algorithm to Compute Maximum k-Plexes in Social Network Analysis
A clique model is one of the most important techniques on the cohesive subgraph detection; however, its applications are rather limited due to restrictive conditions of the model. Hence much research resorts to k-plex a graph in which any vertex is adjacent to all but at most k vertices which is a relaxation model of the clique. In this paper, we study the maximum k-plex problem and propose a f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009